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We report the measurement of an inclusive partial branching fraction for charmless semilep- 
tonic B decay and the extraction of \V u b\- Candidates for B — > X u tv are identified using a 
novel X„-reconstruction method with simultaneous requirements on the hadronic recoil mass 
{Mx) and the invariant mass squared of lepton-neutrino system (q 2 ). Based on 86.9 fb _1 
of data taken with the Belle detector, we obtain AB(B -> X u £v;M x < 1.7 GeV/c 2 ,q 2 > 
8.0 GeV 2 /c 2 ) = (7.37±0.89(stat.)±1.12(syst.) ±0.55(6 -> c) ±0.24(6 -» u)) x 10~ 4 and deter- 
mine \V ub \ = (4.66±0.28(stat.)±0.35(syst.)±0.17(6 -> c)±0.08(6 -> ti)±0.58(theory)) x 10~ 3 . 



Introduction 



The off-diagonal element V u b in the CKM matrix is an important ingredient in overconstraining 
the unitarity triangle by measuring its sides and angles. The inclusive charmless semileptonic 
decay B — > X u iv is the most promising process to determine since \V u b\ is related to the 
total branching fraction of B —> X u lv model independently by OPE and HQET with a precision 
of about 7?PI as follows. 

|Kfc| = const x y/B(B -> X u lv){\ ± 0.07(theory)) (1) 

Experimentally we have to apply a cut on the phase space to suppress the hundred times larger 
contribution of background from B — > X c tv. The experiments on the T(45) resonance have used 
the limited region of lepton momentum E^or the hadronic recoil mass Mx^as seen in Figure^ 
The phase space of these experiments contains the so-called collinear region shown in Figure ^ 
In this region OPE fails and a shape function of the B meson is required. Therefore the need to 
extrapolate measured rates from such limited regions results in large theoretical uncertainties on 
| V u b | • A recent theoretical development suggests that one can significantly reduce the theoretical 
uncertainty on the extrapolation by applying simultaneous cuts on Mx and the invariant mass 




Figure 1: Reduction of charmed events by Ei or Mx cuts and definition of the collinear region. 

squared of the lepton-neutrino system (<? 2 )^. We compare the extrapolation errors for three 
cases assuming m I s = 4.70±0.12GeV/c 2 where m£ s is one-half of the perturbative contribution 
to the mass of the T(15). Case 1 (E t > 2.2 GeV):20%; case 2 (M x < 1.55GeV/c 2 ):18%; case 
3 (M x < 1-7 GeV/c 2 and q 2 > 8.0 GeV 2 /c 2 ):10%. There are three methods to reconstruct 
Mx and q 2 . The first one is a traditional neutrino reconstruction method used by CLEO which 
reconstructs the neutrino 4-momentum from the missing 4-momentum by requiring vanishing 
missing mass squared^. Although high reconstruction efficiency is obtained only bad Mx 
resolution is obtainable since the B momentum in CMS of the T(4S) must be neglected and 
as a result signal/noise ratio is poor. Second method is a full-reconstruction method used by 
BaBar where one of two B mesons is fully reconstructed. In this case good Mx resolution is 
obtained, but the reconstruction efficiency is poor like a few times 10 -3 '"'. The third method is 
a novel ^-reconstruction method of Belief based on a combination of neutrino reconstruction 
and a technique called simulated annealing^ to separate the two B meson decays. This method 
allows us to measure Mx and q 2 with good efficiency so that it achieves good statistical precision 
and small theoretical uncertainty with a modest integrated luminosity. We report here the first 
result with simultaneous requirements on Mx and g 2 ^4 This analysis is based on 78.1 fb _1 
data, corresponding to 85 million BB pairs, taken at the T(4S) resonance, and 8.8 fb _1 taken 
at an energy 60 MeV below the resonance, by the Belle detector at the energy-asymmetric 
e + e~ collider KEKB 9 . 

2 Experimental Procedure 

2.1 Neutrino Reconstruction 

We select hadronic events containing one lepton candidate having momentum above 1.2 GeV/c 
in CMS of the T(4S I ). We exclude events containing additional lepton candidates. The neutrino 
is reconstructed from the missing 4-momentum in the event (p u = Px(&S) ~~ ^iPii E u = Ex us) — 
YiiEi). The net observed momentum (E^pj) and energy (Sj-Ej) are calculated using particles 
surviving track quality cuts. We then compute the missing mass of the event, defined as MM 2 = 
E 2 /c A — \pu\ 2 /c 2 , where the sign of E 2 is reversed when E v < 0. The result is shown in the 
left plot of Figure|21 We require —1.5 GeV 2 /c 4 < MM 2 < 1.5 GeV 2 /c 4 to suppress events with 
missing particles and with particles removed due to poor reconstruction quality. For events that 
pass this requirement, we add back tracks and clusters rejected earlier due to reconstruction 
quality, selecting the combination that gives the smallest value of |MM 2 |. The result is shown 
in the right plot of Figure [21 This determines the set of particles that are used in the subsequent 
analysis. Events are further required to have a net charge of or ±1, and a polar angle for the 



missing momentum within the barrel region (32° < < 128°). To suppress beam-gas events, we 
demand that the net charge of all proton candidates be zero. Requiring that the cosine of the 
angle of K L candidates with respect to the missing momentum be less than 0.8 rejects events 
where the neutrino candidate is actually a meson. 




Figure 2: MM 2 distributions. The left and right plots correspond to the case with good quality tracks and the 

case including poor quality tracks, respectively. 

2.2 Simulated Annealing 

We then seek the most likely combination of particles belonging to Xlv, the remainder being 
from the associated i3-meson (B opp ). Six discriminant variables are used: the momentum, 
energy, and polar angle of B opp (p* B , E B , cos9 B ) in the CMS, its charge multiplicity (N^) 
and net charge times the lepton charge (Qg x Qi), and the missing-mass squared recalculated 
with the energy and mass of B opp constrained to the known values (MM Xiu ). Using Monte 
Carlo (MC) simulation events for T(4S) — > BB where at least one B decays into Xlv, we 
determine probability density functions (PDFs) for correct Xlv combinations and for random 
Xtv combinations. From the PDFs we calculate two likelihoods, ^(correct) and £(random). 
The most likely candidate combination in each event is found by minimizing the parameter 
W = C (random) /(£ (random) + ^(correct)). 

To minimize W, we have developed an approximate iterative algorithm based on simulated 
annealing. We start from the initial candidate for Xtv that consists of the lepton and neutrino 
plus approximately one third of the remaining particles, selected randomly. We move a random 
particle (other than the lepton or neutrino) between the X and B opp sides in an iterative way, 
where in one iteration we cross all particles at least once, and search for the combination that 
gives the minimum W with 50 iterations. During the iteration process we take special care 
to reduce the chance of convergence to a local minimum of W. For instance, after every fifth 
iteration we compare all combinations that can be constructed by crossing one particle and 
use the combination that gives the largest value of W to seed a new cycle. We repeat this 
iteration process 10 times, starting each time with a different initial candidate, and select the 
case with the smallest W. Figure El shows the distributions in three of the six discriminant 
variables and W, before and after simulated annealing. Also shown are the distributions for the 
correct combination in signal MC events. The final candidate is required to satisfy: i) W < 0.1, 
ii) 5.1 < E* B < 5.4 GeV, iii) 0.25 < p* B < 0.42 GeV/c, iv) -2 < Q e x Q B < +1, and v) 
—0.2 < MM 2 X £ U < 0.4 GeV 2 /c 4 . Contamination from the continuum is reduced by demanding 
| cos 0Bi\ < 0.8, where Obi ls the angle between the thrust axis of B opp and the lepton momentum. 

2.3 Validity Check 

The validity of the method is checked with two data samples, one containing 38,600 fully- 
reconstructed B — > D*£v decays and the other containing 84,100 B — > J/ifiX, J/ip — ► £ + £~ 




Figure 3: Distributions for 3 discriminants and W before (dashed histogram) and after (solid histogram) simulated 
annealing, for real data. MC distributions for 3 discriminants for correct combination of particles (solid curve). 



decays. For the J/ipX sample we treat one of the two leptons from J /if) as a neutrino, to 
emulate B — > Xtv. The Mx distribution for the D*£u sample is peaked at the D* mass and the 
q 2 distribution for the J/tpX sample is peaked at J/ip mass squared, as expected. The shapes 
are in good agreement between data and MC. These results verify that the simulated annealing 
method works as expected. We use the efficiency ratio r e g = 0.891 ± 0.043 between the data 
and MC for calibration of signal efficiency by MC. 

2.4 Background Subtraction 

We observe 8910 events in the X u lv "signal" region, defined as Mx < 1.7GeV/c 2 and q 2 > 
8.0GeV 2 /c 2 . These consist of semileptonic decays, B — > X c>u £u, other BB background, and 
residual continuum events. The continuum contribution is estimated from off-resonance data to 
be 251 ±48 events and is subtracted directly from the analyzed distributions. The contributions 
from B — > X c lu and other BB backgrounds are estimated via MC in the "background" region 
Mx > 1.8 GeV/c 2 , where X c lv dominates, and extrapolated to the signal region. We estimate 
them by fitting the Mx and q 2 distributions from MC events to those from the data using a 
two-dimensional \ 2 fit method. Contributions to X c lv come from D(*ku, D**lv, and D^-kIv. 
Their branching fractions are floated in the fit. The total rate for other BB backgrounds is also 
floated. The small X u lv contribution is estimated iteratively. The obtained branching fractions 
are consistent with the PDG values^. The BB background in the signal region is estimated to 
be 7283 ± 130 ± 63 events, where the first and second errors come from fit and MC statistics, 
respectively. The upper plots in Figure 0] show the Mx distribution for q 2 > 8.0 GeV 2 /c 2 and 
the q 2 distribution for Mx < 1.7 GeV/c 2 after continuum subtraction. After subtracting the 
BB backgrounds, we obtain distributions for the X u tv signal, shown in the lower plots. The 
net signal is N \, s = 1376±167 events where the error is statistical only. 

3 Extraction of Partial Branching Fraction 

In order to extract the partial branching fraction AB for B — > X u tv in the signal region, a 
Monte Carlo simulation is used to convert iV b s to the true number of signal events produced 
in this region, iV"t rue , and to estimate the efficiency for these events to be observed anywhere, 
^signal- 111 the MC simulation, B — > X u iv decays are simulated based on the prescription 
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Figure 4: (a) M x distribution for q 2 > 8.0 GeV 2 /c 2 . (b) q 2 distribution for M x < 1.7 GeV/c 2 . Points are the 
data and histograms are backgrounds from D*lv (dotted), DIv (short dashed), others (long dashed), and total 
background contribution (solid). Lower plots show the data after background subtraction. Solid curves show the 

inclusive MC predictions for B — > X u iv. 



of Fazio and Neubert^. For two parameters therein we use mj = 4.80 ± 0.12GeV/c 2 and 
p 2 = 0.30 ±0.11 GeV 2 /c 2 derived from the CLEO measurements of the hadronic mass moments 
in inclusive B —* X c li> and photon energy spectrum in B — > X S 7^JJ. Ni TUe is estimated by 
N true = AT obs x p. We find F = 0.938, and thus N truc = 1291 ± 157. The efficiency e signa i is 
predicted to be 0.578%. We determine AB by 0.5 x N tTUC /(e signai i x r e fi) / (2Nb), where r e fj is the 
efficiency correction factor described earlier, Nb is the number of BB events and the factor 0.5 
is needed to take into account the electron and muon data: 

AB = (7.37 ± 0.89 ± 1.12 ± 0.55 ± 0.24) x 10~ 4 . 

The errors are statistical, systematic, from B — * XJLv model dependence, and B —* X u iv model 
dependence, respectively. Main source of systematic uncertainty is the uncertainty in the BB 
background estimation arisen from distortion of the Mx and q 2 distributions due to imperfect 
detector simulation. The model dependence of X c lu is estimated to be 7.4% by varying the D\tv 
plus D^tv fraction in the D**lv by 25% and by varying the slope parameters of the form factors 
for DIv and D*tu, p 2 D = 1.19 ± 0.19 and p 2 = 1.51 ± 0.13 QJ within their errors. B -> X u lv 
model dependence (3.4%) is estimated by varying the parameters of the inclusive model within 
their errors and by comparing to a simulation with a full exclusive implementation of the ISGW2 
modelE21 



4 Extraction of \V u b\ 

As mentioned before AB(B — > X u £u) is related to \V UI 



EJUS] IE! 



|Vy = 0.00444 | Ag < B -y> , L55PS )' /2 (2) 
',0.002 xl.21G(<,J,„m cut ) r B J x > 



,1S N 5 



where 1.21G(q 2 nt , m cut ) = f u X yjj^vj^ J ' $ u re P resen ts the fraction of events with q 2 > q 2 ut 

and Mx < m cut . G(q 2 ut ,m cut ) is calculated to 0(a 2 ) and 0{l/m 2 ) in Ref.4, including the 
effect of the Fermi motion of the b quark, which is expressed in terms of m£ • We use ml = 



4.70 ± 0.12GeV/c 2 En21 which gives G(q 2 ut ,m cut ) = q.268 1 ^ 1 . The theoretical uncertainty on 
\V u b\ is determined only by the uncertainty on G(q^ ut , m cut ). The uncertainty on G(q 2 ut , m cut ), 
in total 25%, consists of 6% for perturbative, 8% for nonperturbative ter ms (d ominated by the 
weak annihilation contribution), and 23% from the uncertainty on raj^' ' '. The 23% error 
contains 10% for f u and 13% for (m^ s ) 5 . These uncertainties are positively correlated, so we 
add them linearly, whereas they have been given separately in conventional analyses. Using 
t b = 1.604 ± 0.012 psffl, we obtain 

\V ub \ = (4.66 ± 0.28 ± 0.35 ± 0.17 ± 0.08 ± 0.58) x 10~ 3 

where the errors are statistical, systematic, b — ► c model dependence, b — > u model dependence, 
and theoretical uncertainty for OPE, respectively. 

5 Summary 

We have performed the first measurement of \Vyj,\ with simultaneous requirements on Mx and q 2 
using a novel ^-reconstruction method. The re sult of \V u b\ = (4.66 ±0.76) x 10~ 3 is consistent 
with the previous inclusive measure ment s^ ^ ^ and the total error is comparable with those of 
the previous measurements on Y (4,5)12131 D ue to simultaneous requirements on Mx an d q 2 , the 
extrapolation error is much smaller than those of the previous measurements on T(4S)'^^. 
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